A style control technique for singing voice synthesis based on multiple-regression HSMM

نویسندگان

Takashi Nose

Misa Kanemoto

Tomoki Koriyama

Takao Kobayashi

چکیده

This paper proposes a technique for controlling singing style in the HMM-based singing voice synthesis. A style control technique based on multiple regression HSMM (MRHSMM), which was originally proposed for the HMM-based expressive speech synthesis, is applied to the conventional technique. The idea of pitch adaptive training is introduced into the MRHSMM to improve the modeling accuracy of fundamental frequency (F0) associated with notes. A robust vibrato modeling technique based on a moving average filter is also proposed to reproduce a natural-sounding vibrato expression even when the vibrato expression of the original singing voice is unclear. Subjective evaluation results show that users can intuitively control a singing style while keeping naturalness of the synthetic voice.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A technique for controlling voice quality of synthetic speech using multiple regression HSMM

This paper describes a technique for controlling voice quality of synthetic speech using multiple regression hidden semi-Markov model (HSMM). In the technique, we assume that the mean vectors of output and state duration distribution of HSMM are modeled by multiple regression with a parameter vector called voice quality control vector. We first choose three features for controlling voice qualit...

متن کامل

A style control technique for speech synthesis using multiple regression HSMM

This paper presents a technique for controlling intuitively the degree or intensity of speaking styles and emotional expressions of synthetic speech. The conventional style control technique based on multiple regression HMM (MRHMM) has a problem that it is difficult to control phone duration of synthetic speech because HMM has no explicit parameter which models phone duration appropriately. To ...

متن کامل

Factored maximum likelihood kernelized regression for HMM-based singing voice synthesis

In our previous work, we proposed factored maximum likelihood linear regression (FMLLR) adaptation where each MLLR parameter is defined as a function of a control vector. In this paper, we introduce a novel technique called factored maximum likelihood kernelized regression (FMLKR) for HMMbased style adaptive speech synthesis. In FMLKR, nonlinear regression between the mean vector of the base mo...

متن کامل

Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis

This paper describes a style adaptation technique using hidden semi-Markov model (HSMM) based maximum likelihood linear regression (MLLR). The HSMM-based MLLR technique can estimate regression matrices for affine transform of mean vectors of output and state duration distributions which maximize likelihood of adaptation data using EM algorithm. In this study, we apply this adaptation technique ...

متن کامل

A Perceptual Expressivity Modeling Technique for Speech Synthesis Based on Multiple-Regression HSMM

This paper describes a technique for modeling and controlling emotional expressivity of speech in HMM-based speech synthesis. A problem of conventional emotional speech synthesis based on HMM is that the intensity of an emotional expression appearing in synthetic speech completely depends on the database used for model training. To take into account the emotional expressivity that listeners act...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

A style control technique for singing voice synthesis based on multiple-regression HSMM

نویسندگان

چکیده

منابع مشابه

A technique for controlling voice quality of synthetic speech using multiple regression HSMM

A style control technique for speech synthesis using multiple regression HSMM

Factored maximum likelihood kernelized regression for HMM-based singing voice synthesis

Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis

A Perceptual Expressivity Modeling Technique for Speech Synthesis Based on Multiple-Regression HSMM

عنوان ژورنال:

اشتراک گذاری